Models for Parallel Computation
نویسنده
چکیده
Parallel computing must meet seemingly contradictory goals: software development should be carried out in an architecture-and technology-independent environment, while both algorithms and system software should take full advantage of the features of the underlying parallel architecture. At the same time, application programmers cannot be expected to be parallel processing experts. Usability, scalability, and portability are thus central issues in high performance computing. A recognized important step towards achieving these goals is the development of a computational model which accurately reeects constraints of existing machines , has broad applicability with respect to existing and future machines, and allows accurate prediction of performance. Such a computational model should provide a bridge between software and hardware and fullll the same function as the von Neumann model for sequential computation. In recent years parallel models have been proposed reened, and evaluated. While no model seems to yet Research supported in part by ARPA under contract DABT63-92-C-0022ONR. The views and conclusions contained in this paper are those of the authors and should not be interpreted as representing oocial policies, expressed or implied, of the U.S. government. fullll the stated goals, signiicant progress has been made. This session contains three papers providing diierent viewpoints on parallel models. In the following we give a brief description of the development of parallel models and the papers appearing in this session. The discussion makes references to various models, but the list is not intended to be exhaustive. The PRAM model, a simple shared memory model, was introduced in 9] and it has proven useful in identifying the inherent parallelism in problems. The PRAM has allowed the development of architecture-independent parallel algorithms at a time when parallel platforms were changing rapidly 15, 17, 22]. Some PRAM algorithms have been successfully translated into eecient distributed memory algorithms. For other, especially ones with highly data-dependent communication patterns, eecient adaptations have proven diicult. The PRAM model is widely considered to not match the essential features of existing parallel machines. The main objections include the lack of being able to account for communication la-tency as well as memory and network contention. The paper by Gibbons argues the important role the PRAM model still plays 11]. He describes a new shared memory model, the Queued Shared Memory Model (QSM) which accounts for limited communication bandwidth by evaluating the maximum memory 1 contention in each phase of the algorithm. Gibbons shows that the QSM model has …
منابع مشابه
Optimization of Agricultural BMPs Using a Parallel Computing Based Multi-Objective Optimization Algorithm
Beneficial Management Practices (BMPs) are important measures for reducing agricultural non-point source (NPS) pollution. However, selection of BMPs for placement in a watershed requires optimizing available resources to maximize possible water quality benefits. Due to its iterative nature, the optimization typically takes a long time to achieve the BMP trade-off results which is not desirable ...
متن کاملParallel computation framework for optimizing trailer routes in bulk transportation
We consider a rich tanker trailer routing problem with stochastic transit times for chemicals and liquid bulk orders. A typical route of the tanker trailer comprises of sourcing a cleaned and prepped trailer from a pre-wash location, pickup and delivery of chemical orders, cleaning the tanker trailer at a post-wash location after order delivery and prepping for the next order. Unlike traditiona...
متن کاملEfficient implementation of low time complexity and pipelined bit-parallel polynomial basis multiplier over binary finite fields
This paper presents two efficient implementations of fast and pipelined bit-parallel polynomial basis multipliers over GF (2m) by irreducible pentanomials and trinomials. The architecture of the first multiplier is based on a parallel and independent computation of powers of the polynomial variable. In the second structure only even powers of the polynomial variable are used. The par...
متن کاملAn Efficient Algorithm for Workspace Generation of Delta Robot
Dimensional synthesis of a parallel robot may be the initial stage of its design process, which is usually carried out based on a required workspace. Since optimization of the links lengths of the robot for the workspace is usually done, the workspace computation process must be run numerous times. Hence, importance of the efficiency of the algorithm and the CPU time of the workspace computatio...
متن کاملFast Finite Element Method Using Multi-Step Mesh Process
This paper introduces a new method for accelerating current sluggish FEM and improving memory demand in FEM problems with high node resolution or bulky structures. Like most of the numerical methods, FEM results to a matrix equation which normally has huge dimension. Breaking the main matrix equation into several smaller size matrices, the solving procedure can be accelerated. For implementing ...
متن کاملPreservation of Stochastic Orderings of Interdependent Series and Parallel Systems by Componentwise Switching to Exponentiated Models
This paper discusses the preservation of some stochastic orders between two interdependent series and parallel systems when the survival and distribution functions of all components switch to the exponentiated model. For the series systems, the likelihood ratio, hazard rate, usual, aging faster, aging intensity, convex transform, star, superadditive and dispersive orderings, and for the paralle...
متن کامل